Sublanguage Dependent Evaluation: Toward Predicting NLP performances
نویسنده
چکیده
In Natural Language Processing (NLP) Evaluation, such as MUC (Hirshman, 1998), TREC (Harman, 1998), GRACE (Adda et al., 1997), SENSEVAL (Kilgariff, 1998), metrics on the performances, such as precision, recall, or f-measure are used. Nevertheless, performance results are often average measurements computed over the complete test. They do not give any clues about the system’s robustness. We conceive evaluations being not only a processs to show how good the systems are on a given dataset, but also as an aid for choosing which system or approach to use to build a NLP application for a specific subset of the language. In this case, knowing which system performs better on average does not help us to find which is the best for a given subset of a language. As a matter of fact, this aspect of the reuse paradigm is rarely investigated in the litterature about workbenches especially designed to adapt quickly to new language resources, such as GATE (Cunningham, 1997), In the present article, the existing approaches which take into account language heterogeneity and offer methods to identify sublanguages are presented. Then we propose a new metric to assess robustness and we study the existence of a correlation between the performance variations observed for POS tagging and the different sublanguages identified in the Penn Tree Bank Corpus. The work we present here is a first step in the development of predictive evaluation methods, intended to propose new tools to help in determining in advance the range of performance that can be expected from a system on a given dataset. keywords : (predictive) evaluation, POS tagging, textual typology, sublanguages, performance variations.
منابع مشابه
Talk the Walk: Robotic NLP vs. Human Sublanguage Acquisition
The paper investigates the appropriateness of natural language processing (NLP) in the context of robotassisted navigation for the visually impaired. Several assumptions of corpus-based robotics are examined. It is argued that, in the short term, NLP may be inadequate and, in the long term, it may not be necessary to enable robotic guides to solicit route directions from bystanders. Human subla...
متن کاملSublanguage Analysis Applied to Trouble Tickets
A feasibility study was conducted to determine whether the sublanguage methodology of NLP could analyze and represent the vital information contained in trouble tickets’ ungrammatical text and to explore various knowledge mining approaches to render the data contained in these documents accessible for analysis and prediction. Experiments showed that the linguistic characteristics of trouble tic...
متن کاملFacilitating post-surgical complication detection through sublanguage analysis
Identification of postsurgical complications is the first step towards improving patient safety and health care quality as well as reducing heath care cost. Existing NLP-based approaches for retrieving postsurgical complications are based on search strategies. Here, we conduct a sublanguage analysis study using free text reports available for a cohort of patients with postsurgical complications...
متن کاملRobotic NLP vs. Human Subset Language Acquisition
This extended abstract investigates the appropriateness of natural language processing (NLP) in the context of robot-assisted navigation for the visually impaired. Several assumptions of corpus-based robotics are examined. It is argued that, in the short term, NLP may be inadequate and, in the long term, it may not be necessary to enable robotic guides to solicit route directions from bystander...
متن کاملLinguistics and Natural Language Processing
Introduction The paper addresses the issue of cooperation between linguistics and natural language processing (NLP), in general, and between linguistics and machine translation (MT), in particular. It focuses on just one direction of such cooperation, namely applications of linguistics to NLP, virtually ignoring for now any possible applications of NLP to linguistics, which can range from provi...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2000